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In this note, d < n are positive integers. Let S' be a finite set of cardinal [S'! = n and let 2^ 
denote its power set, i.e. the set of its snbsets. A d-subset of S' is a snbset of S of cardinal d. 
Let C 2^ and A C S'. The trace of T on A is the family J^a = {E O A ; E E E}. One says 
that E shatters A if Ea = 2,^- The VC-dimension of E is the maximal cardinal of a subset of 
S' that is shattered hy E The following is well-known 

Theorem 1 . (Vapnik-Chervonenkis, Sauer, Shelah) 

If VC-dim{E) < d (i.e. if E shatters no {d + l)-subset of S) then \E\ < c{d,n), where 


cKn) = (;) + ... + (”). 


Moreover this bound is tight: It is achieved e.g. for E = the family of all k-subsets of S, 
0 < k < d. 


A first natural question is: 

Question 1 . Assume a family E 2^ is maximal for the inclusion among all families of 
VC-dimension at most d. Does E always have the maximal possible cardinal c{d, n) 7 

Let us define the index of E as follows: 


Ind = max{(i G {0, ...,n} ; E shatters all d-subsets of S}. 

Let C{d,n) = min{|J^| ; IndJ^ = d}. For instance, we have = 2, with the (only 

possible) choice E = {0, S}. Of course we have 2'^ < C(d.in) < 2^. The question is: 

Question 2 . Give the exact value of C{d, n) for 2 < d < n. If this is not possible, give lower 
and upper bounds as accurate as possible. 

A well-known duality yields another formulation of Question O Let (f : S ^ 2^, a i-G- {E G 
E ] a E E} and set S = y:>{S). In this manner, we have for all a G S' and all E E E: 

a E E yy E E (p{a). (1) 

One can check that E shatters A C S' if and only if, for every partition (5, C) of A (i.e. 
A = Be C and 5 0 0 = 0) the intersection ^ flfees ^ flceC nonempty, where 

the notation Y stands for E\Y. 

If Ind5 > 2, then (p is a one-to-one correspondance from S' to S, hence we have logn < 
0(d, n) for all 2 < d < n, where log denotes the logarithm in base 2. 
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The case d = 2. Using for instance the binary expansion, it is easy to show that the order 
of magnitude of C{2,n) is actually logn. The next statement rehnes this. 

Proposition 2 . Ifn = i(f) = then C{2,n) = 21. 

Proof. (Recall the notation A = \ A.) We hrst prove by contradiction that C{2, n) > 21 — 1. 

Actually, if a family fF of subsets of S shatters all 2-subsets of S', then the image 5 C 2-^ of S' 
by ip must satisfy 

VA yf B E S, An B, An B, An B, and An B are nonempty. (2) 

In particular iS is a Sperner family of fF (i.e. an antichain for the partial order of inclusion; 
one hnds several other expressions in the literature: ‘Sperner system’, ‘independent system’, 
‘clutter’, ‘completely separating system’, etc.). For a survey on Sperner families and several 
generalizations, we refer e.g. to [1] and the references therein. 

Assume now that \fF\ = 2/ — 1; it is known [6l IH [3] that all Sperner families of fF have a 
cardinal at most , and that there are only two Sperner families of maximal cardinal: the 
families and ('^), i.e. of {I — l)-subsets, resp. /-subsets of fF. However, none of these 
families satishes both An B and An B nonempty in ([2]). As a consequence, we must have 
|J^| > 21. 

Conversely, let S = {ai,...,a„}, consider gg|^ gf /-subsets of {!,..., 2/}, 

and choose one element in each pair of complementary /-subsets. We then obtain a family 
{Ai,..., An} which satishes ([2]). Now we set T = {Ei ,..., E 21 }, with Ei = {aj ; i G Aj}. The 
characterization ([T]) shows that E shatters every 2-subset of S. □ 

The proof of the following statement is straightforward. 

Corollary 3 . If < n < then 21 < C{2,n) <21 + 2. 

The upper bound can be slightly improved: One can prove that, if < n < then 

2/ < C{2,n) <21 + 1. 

Question 3 . It seems that we have C{2,n) = k if and only if ^ 

where |_a:J denotes the integer part of x. Is it true? Is it already known? 

The hrst values are (7(2,2) = (7(2,3) = 4, (7(2,4) = 5, (7(2, 5) = • • • = (7(2,10) = 6. Com¬ 
puter seems to be useless, at least for a naive treatment. Already in order to obtain (7(2,11) = 7, 
we would have to verify that (7(2,11) > 6, i.e. to hnd, for each of the (^g ) ~ 10^^ families E 
in 2“^ some 2-subset that is not shattered by the family. (Alternatively, in the dual statement, 
we have to check “only” 7.10^^ families S in 2^.) 

The case d E 3. From now, we assume n > 4. 

Proposition 4 . For all 3 < d < n, we have C{d, n) < ^ (3 logn)”*. 

The constant 3 can be improved. The proof below shows that, for all a > 1 and all n large 
enough, C{d,n) < ^ (alogn)‘‘^. 

Proof. Let Eq C 2"^ be a minimal separating system of S, i.e. such that, for all a,b E S there 
exists En E Eo which satishes b ^ E^ 3 a. Since this amounts to choosing Eo minimal such 
that S = p{S) is a Sperner family for Eo, we know that |J7)| = 77 if and only if ^ 

n < (Ljvy 2 j)’ hence N := |J7i| < 2 -|- logn -|- ^loglogn < 31ogn since n > 4. We assume 
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> 2 in the sequel. Given two disjoint subsets B and G of S' such that \B U C\ = d, the set 
= flcec (UfeeB El) contains B and does not meet C. Let B be the collection of all such 
sets then B shatters all subsets of S of cardinal at most d. 

To estimate |J^|, we consider Bk the collection of all such sets with \B\ = k (and thus 
|G| = d — k). We have \Bk\ = ('^) i^d-k) N = |JS)|). Then we choose T = lJfc=o‘^fc- 

obtain |.F| < Y.U (1) (It) = 02- (3 logn)f 

□ 

Question 4 . Is (log n) LGd L(<^+i)/2j right order of magnitude for C (d, n) ? 

By constructing auxiliary Sperner families from S, it is possible to give a better lower bound for 
G(d, n) than only G(d, n) > C{2, n). For instance, in the case d = 3, for all distinct A, B,C G S, 
we must have Ad B ^ C. One can check that this implies that the family {A Hi?; A,B & S} 
is a Sperner family, therefore we obtain (”) < ([cy 3 ^n”j/ 2 )- Unfortunately, this does not modify 
the order of magnitude. Already in this case d = 3, we do not know whether G(3, n) is of order 
logn, (logn)^, or an intermediate order of magnitude. Another formulation is: 

Question 5 . Prove or disprove: There exists C > 0 such that, for all /c G N, if fF is a hnite 
set of cardinal k and S 0 2^ satishes VA, B,C E S, Ad B C, then |iS| < C 2^^. 
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